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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Crabtree, Gerald R, 
Schreiber, Stuart L, 
Spencer, David M. 
Wandless, Thomas J. 
Belshaw, Peter 

(ii) TITLE OF INVENTION: REGULATED TRANSCRIPTION OF TARGETED 
Ui) ^^^^g ^ OTHER BIOLOGICAL EVENTS 

(iii) NUMBER OF SEQUENCES: 81 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: ARIAD Pharmaceuticals, Inc. 

(B) STREET: 26 Landsdowne Street 

(C) CITY: Cambridge 

(D) STATE: Massachusetts 

(E) COUNTRY: USA 

(F) ZIP: 02139 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 
(B.) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC/DOS /MS/DOS 

(D) SOFTWARE: Patent In Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 08/478,386 

(B) FILING DATE: 07/JUN/1995 

(C) CLASSIFICATION: 435 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Figg, E. Anthony 

(B) REGISTRATION NUMBER: 27,195 

(C) REFERENCE/DOCKET NUMBER: 2054-114A 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (202) 783-6040 

(B) TELEFAX: (202) 783-6031 
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(2) INFORMATION FOR SEQ ID N0:1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 

Met Gly Ser Ser Lys Ser Lys Pro Lys Asp Pro Ser Gin Arg 
1 5 * 10 



(2) INFORMATION FOR SEQ ID NO: 2; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 
GTTAAGTTAA C 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:3: 
TGACTCAGCG C' 

(2) INFORMATION FOR SEQ ID N0:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(ix) FEATURE: 
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(A) NAME/KEY: misc_feature 

(B) LOCATION: 6 . . 11 

(D) OTHER INFORMATION: /note= "Sac II restriction site." 

(ix) FEATURE: 

(A) NAME/KEY: misc_signal 

(B) LOCATION: 12, .16 

(D) OTHER INFORMATION: /note= "Kozak sequence." 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 17.. 31 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 17., 33 

(D) OTHER INFORMATION: /note= "Region of homology with 
target sequence," 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

CGACACCGCG GCCACC ATG GCC ACA ATT GGA GC 

Met Ala Thr He Gly 
1 5 



(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 
( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 

Met Ala Thr He Gly 
1 5 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
{D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY; misc_feature 

(B) LOCATION: 6,-11 , . 

(D) OTHER INFORMATION: /note= "Xho I restriction site." 
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(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 12 . • 27 

(D) OTHER INFORMATION: /note= "Region of homology with 

target sequence . " 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 
CGACACTCGA GAGCCCATGA CTTCTGG 



(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids- 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1..4 

(D) OTHER INFORMATION: /note= "Translation product of 
complement of SEQ ID NO: 6, bases 9 to 20," 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

Ser Trp Ala Leu 
1 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 6. .11 

(D) OTHER INFORMATION: /note= "Xho I restriction site." 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 12 -.41 

(D) OTHER INFORMATION: /not6= "Region of homology with 
target sequence . " 
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(i>c) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 9.. 41 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 2 8 

(D) OTHER INFORMATION: /note= "A to G." 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

CGACACTC GAG CTC TGC TAC TTG CTA GGT GGA ATC CTC TTC 41 
Glu Leu Cys Tyr Leu Leu Gly Gly lie Leu Phe 
1 5 10 



(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 

Glu Leu Cys Tyr Leu Leu Gly Gly He Leu Phe 
15 10 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 3., 8 ... 

(D) OTHER INFORMATION: /note= "Eco RI restrict lon . Site . 

(ix) FEATURE ; 

(A) NAME/KEY: Tnisc_f eature 

(B) LOCATION: 9 24 . 
(D) OTHER INFORMATION: /note= "Region of homology witu 

target sequence . " 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 24 

(D) OTHER INFORMATION: /note^^ "G to C." 
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(ix) FEATURE: 

(A) NAME/KEY: misc_signal 

(B) LOCATION: complement (9.-11) 

(D) OTHER INFORMATION: /note= "Trans lational stop encoded 
in complementary strand," 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
GCGAATTCTT AGCGAGGGGC CAGC 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 

ic) STRANDEDNESS : single 
(D) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1. .4 

(D) OTHER INFORMATION: /note= "Translat ional product of 
complement to SEQ ID NO:10, bases 12 to 23." 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Leu Ala Pro Arg 
1 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

( i i ) MOLECULE TYPE : cDNA 

( ix) FEATURE : 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 3. .8 

(D) OTHER INFORMATION: /note= "Eco RI restriction," 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 12 . . 17 

(D) OTHER INFORMATION: /note= "Sal I restriction site." 

(ix) FEATURE: 

(A) NAME/KEY: misc_signal 

(B) LOCATION: complement (9.. 11) 
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(D) OTHER INFORMATION: /note= "Translational stop signal 
encoded on complementary strand." 

. (ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 18,. 33 

(D) OTHER INFORMATION: /note=: "Region of homology with 
target sequence." 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
GCGAATTCTT AGTCGACGCG AGGGGCCAGG GTC 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino, acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: peptide 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1..4 ^ ^ ^ 
(D) OTHER INFORMATION: /note= "Translat ional product of 

complement to SEQ ID N0:12, bases 18 to 29." 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:13: 

Leu Ala Pro Arg 
1 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 4.-9 . . 

(D) OTHER INFORMATION: /note= "Xho I restriction site. 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 13 

(D) OTHER INFORMATION: /note= "T to G." 
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(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 4 . .25 . . ^ 
(D) OTHER INFORMATION: /not:e= "Region of homology with 

target sequence . " 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 10.. 24 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

GGGCTCGAG CTC GGC TAC TTG CTA G 25 
Leu Gly Tyr Leu Leu 
1 5 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:15: 

Leu Gly Tyr Leu Leu 
1 5 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 6 , . 11 

(D) OTHER INFORMATION: /note= "Xho I restriction site." 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 12.. 26 , 
(D) OTHER INFORMATION: /note= "Region of homology with 

target sequence." 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
CGACACTCGA GGTGACGGAC AAGGTC 



26 
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(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 6.. 11 

(D) OTHER INFORMATION: /note= "Sal I restriction site." 

(ix) FEATURE: 

(A) NAME/KEY: inisc_f eature 

(B) LOCATION: 12.. 26 

(D) OTHER INFORMATION: /note= "Region of homology with 
target sequence . " 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17; 
CGACAGTCGA CCCAATCAGG GACCTC 



(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 1..5 

(D) OTHER INFORMATION: /note=: "Xho I restriction site," 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 10. .15 

(D) OTHER INFORMATION: /note= "Bsi WI restriction site." 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 6.. 32 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 



TCGAG TAT CCG TAC GAC GTA CCA GAC TAC GCA G 
Tyr Pro Tyr Asp Val Pro Asp Tyr Ala 
1 5 



33 
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(2) INFORMATION FOR SEQ ID NO; 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

Tyr Pro Tyr Asp Val Pro Asp Tyr Ala 
1 5 



(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 1.-5 

(D) OTHER INFORMATION: /note= "Sal I restriction site." 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20; 
TCGACTGCGT AGTCTGGTAC GTCGTACGGA TAC 3 3 



(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 1..5 

(D) OTHER INFORMATION: /note= "Sal I restriction site." 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
TCGACTATCC GTACGACGTA CCAGACTACG CAC 



33 
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[2) INFORMATION FOR SEQ ID NO: 22; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 1.-5 

(D) OTHER INFORMATION: /note= "Xho I restriction site." 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
TCGAGTGCGT AGTCTGGTAC GTCGTACGGA TAG 



33 



(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 80 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 6 . . 11 

(D) OTHER INFORMATION: /note= "Sac II restriction site 

(ix) FEATURE: 

(A) NAME/KEY: misc__signal 

(B) LOCATION: 12 . .16 

(D) OTHER INFORMATION: /note= "Kozak secfuence . '» 

(ix) FEATURE: 

(A) NAME/KEY: tnisc^signal 

(B) LOCATION: 17.. 58 

(D) OTHER INFORMATION: /note= "Myristoylation signal." 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 59, .64 

(D) OTHER INFORMATION: /note= "Xho I restriction site. 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 65.. 80 

(D) OTHER INFORMATION: /note= "Zeta homology." 
(ix) FEATURE: 
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(A) NAME/KEY: CDS 

(B) LOCATION: 17.-79 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

CGACACCGCG GCCACC ATG GGG AGT AGC AAG AGC AAG CCT AAG GAC CCC 4 9 

Met Gly Ser Ser Lys Ser Lys Pro Lys Asp Pro 
15 10 

AGC CAG CGC CTC GAG AGG AGT GCA GAG ACT G 80 
Ser Gin Arg Leu Glu Arg Ser Ala Glu Thr 
15 20 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

Met Gly Ser Ser Lys Ser Lys Pro Lys Asp Pro Ser Gin Arg Leu Glu 
1 5 10 . 15 

Arg Ser Ala Glu Thr 
20 



(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 12.. 26 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 6.. 11 

(D) OTHER INFORMATION: /note= "Xho I restriction site. 

(ix) FEATURE: 

(A) NAME/KEY: mis cofeature 

(B) LOCATION: 12., 27 
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(D) OTHER INFORMATION: /note= "Region of homology with 
target sequence . " 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

CGACACTCGA G GAG CTC TGT GAC GAT G 
Glu Leu Cys Asp Asp 
1 5 



(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

Glu Leu Cys Asp Asp 
1 5 



(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 6.. 11 

(D) OTHER INFORMATION: /note= "Xho I restriction site." 

(ix) FEATURE: 

(A) NAME/KEY : misc_f eature 

(B) LOCATION: 12.. 41 

(D) OTHER INFORMATION: /note= "Region of homology with 

target sequence . " 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 27, ,29 

(D) OTHER INFORMATION: /note= "GAT to AAG." 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 9.. 41 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 




CGACACTC GAG CTC TGC TAC TTG CTA AAG GGA ATC CTC TTC 41 
Glu Leu Cys Tyr Leu Leu Lys Gly lie Leu Phe 
1 5 10 



(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 
( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

Glu Leu Cys Tyr Leu Leu Lys Gly lie Leu Phe 
15 10 



(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single' 

(D) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE:- cDNA 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 6 . . 11 

(D) OTHER INFORMATION: /note= "Xho I restriction site." 

(ix) FEATURE: ' 

(A) NAME/KEY: CDS 

(B) LOCATION: 9.. 44- 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 27.-44 

(D) OTHER INFORMATION: /note= "Region of homology with target 

sequence. " 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:29: 

CGACACTC GAG CTG CTG GAT CCG AAG CTC TGC TAC TTG CTA AAG 44 
Glu Leu Leu Asp Pro Lys Leu Cys Tyr Leu Leu Lys 
15 10 



(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 12 amino acids 

(B) TYPE; amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 

Glu Leu Leu Asp Pro Lys Leu Cya Tyr Leu Leu Lys 
15 10 



(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 6 . . 11 

(D) OTHER INFORMATION: /note= "Xho I restriction site. 

(ix) FEATURE: 

(A) NAME/KEY; Tnisc_f eature 

(B) LOCATION: 12.. 31 

(D) OTHER INFORMATION: /note= "Region of homology with 

target sequence . " 

( ix) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 9, .31 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

CGACACTC GAG ACA ACA GAG TAC CAG GTA GC 
Glu Thr Thr Glu Tyr Gin Val Ala 
1 5 



(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 
( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
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Glu Thr Thr Glu Tyr Gin Val Ala 
1 5 

(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 6 . , 11 

(D) OTHER INFORMATION: /note= "Xho I restriction site." 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 12.. 28 

(D) OTHER INFORMATION: /note= "Region of homology with 

target sec[uence." 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 9.. 28 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33-: 

CGACACTC GAG GGC GTG CAG GTG GAG AC 2 8 

Glu Gly Val Gin Val Glu Thr 
1 5 



(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:34: 

Glu Gly Val Gin Val Glu Thr 
1 5 

(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: CDNA 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 6 , . 11 

(D) OTHER INFORMATION: /note= "Sal I restriction site." 

(ix) FEATURE: 

(A) NAME/KEY: misc__f eature 

(B) LOCATION: 12., 27 

(D) OTHER INFORMATION: /note= "Region of homology with 

target sequence . " 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION; complement (9,. 26) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 
CGACAGTCGA CTTCCAGTTT TAGAAGC 27 

(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 36: 

Leu Leu Lys Leu Glu Val 
1 5 

(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 7 . . 12 

(D) OTHER INFORMATION; /note= "Xho I restriction site." 

(ix) FEATURE: 

(A) NAME/KEY: CDS 
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(B) LOCATION: 10,. 27 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 13 . .27 

(D) OTHER INFORMATION: /note= "Region of homology with 

target sequence . " 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

TCGACACTC GAG ACG GGG GCC GAG GGC 27 
Glu Thr Gly Ala Glu Gly 
1 5 



(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 

Glu Thr Gly Ala Glu Gly 
1 5 



(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 7 . , 12 

(D) OTHER INFORMATION: /note= "Sal I restriction site." 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: complement (10.. 18) 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 13.. 28 

(D) OTHER INFORMATION: /note- "Region of homology with 

target sequence . " 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 
CCGACAGTCG ACCTCTATTT TGAGCAGC 



(2) INFORMATION FOR SEQ ID NO:40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:40 

He Glu Val 
1 



(2) INFORMATION FOR SEQ ID N0:41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 
CGACACCGCG GCCACCATGA AGCTACTGTC TTCTATCG 



(2) INFORMATION FOR SEQ ID NO:42:. 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 
CGACAGTCGA CCGATACAGT CAACTGTC 



(2) INFORMATION FOR SEQ ID N0:43: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 8 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 6. .11 

(D) OTHER INFORMATION: /note= "Sac II restriction site." 

{ ix) FEATURE : 

(A) NAME/KEY: misc_signal 

(B) LOCATION: 12. .16 

(D) OTHER INFORMATION: /note^ "Kozak sequence." 

( ix) FEATURE : 

(A) NAME/KEY: CDS 

(B) LOCATION: 17 . .37 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 17 , .38 . 

(D) OTHER INFORMATION: /note= "Gal4 (1-147) coding region." 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:43: 

CGACACCGCG GCCACC ATG AAG CTA CTG TCT TCT ATC G 3 8 

^Iet Lys Leu Leu Ser Ser lie 
1 5 



(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:44: 

Met Lys Leu Leu Ser Ser lie 
1 5 



(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 
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(A) NAME/KEY: misc_feature 

(B) LOCATION: 1..17 , . n ^ 
(D) OTHER INFORMATION: /note= "Region encoding for C- terminal end 

of Gal4 (1-147) . " 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 3 . . 17 

(ix) FEATURE: 

(A) NAME/KEY: misc^f eature 

(B) LOCATION: 18.. 23 . . 

(D) OTHER INFORMATION: /note= "Sal I restriction site." 



(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 45: 

GA CAG TTG ACT GTA TCG GTCGACTGTC G 
Arg Gin Leu Thr Val Sar 
1 5 



(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION; SEQ ID NO:46: 

Arg Gin Leu Thr Val Ser 
1 5 



(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 
CGACACCGCG GCCACGATGG TTTCTAAGCT GAGC 



(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 28 base pairs 
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(B) TYPE; nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:48: 
CGACAGTCGA CCAACTTGTG CCGGAAGG 



(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 6., 11 

(D) OTHER INFORMATION: /note= "Sac II restriction site," 

(ix) FEATURE: 

(A) NAME/KEY: misc_signal 

(B) LOCATION: 12.. 16 

(D) OTHER INFORMATION: /note= "Kozak sequence." 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 17.. 3 4 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 17.. 34 

(D) OTHER INFORMATION: /note= "Region encoding N-terminal 
end of HNFl (1281) . " 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:49: 

CGACACCGCG GCCACC ATG GTT TCT AAG CTG AGC 

Met Val Ser Lys Leu Ser 
1 5 



(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: misc_signal 

(B) LOCATION: 3 . .7 

(D) OTHER INFORMATION: /note= "Kozak sequence," 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 1 . . 11 

(D) OTHER INFORMATION: /note= "Complementary to bases 5 to 
15 of SEQ ID NO: 54. " 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 
GGCCACCATG C 



(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 amino acids 

(B) TYPE; amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 



(ix) FEATURE: 

(A) NAME/KEY; Peptide 

(B) LOCATION: 1..3 

(D) OTHER INFORMATION: /note= "Translation product of SEQ 

ID NO:53 and SEQ ID NO:55. Translat ional 
start site at base 8 of SEQ ID NO: 53." 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54; 

Met Leu Glu 
1 



(2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 14. ,17 

(D) OTHER INFORMATION: /note= "Sac II restriction site overhang. 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 

Met Val Ser Lys Leu Ser 
1 5 



(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 1..20 

(D) OTHER INFORMATION: /note= "Region encoding for C-terminal end 

of HNFl (1-282) . " 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 3 . . 17 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 

CC TTC COG CAC AAG TTG GTCGACTGTC G 2 8 

Ala Phe Arg His Lys Leu 
15 



(2) INFORMATION FOR SEQ ID NO:.52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 

Ala Phe Arg His Lys Leu 
1 5 



(2) INFORMATION FOR SEQ ID NO:53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 



05/13/2005 02:29 FAX 617 951 7050 



ROPES GRAY 



[2]028/035 



(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 1. .5 

(D) OTHER INFORMATION: /note= "Sal I restriction site overhang." 



(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 5. .27 

(D) OTHER INFORMATION: /note= "Complementary to SEQ ID NO: 60, 

bases 5 to 27 . " 

.(xi) SEQUENCE DESCRIPTION: SEQ ID NOiBS: 
TCGACCCTAA GAAGAAGAGA AAGGTAC 27 



(2) INFORMATION FOR SEQ ID NO: 59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
{ D ) TOPOLOGY : 1 inear 



(ix) FEATURE: 

(A) NAME/KEY: Peptide 

(B) LOCATION: 1 • ,11 

(D) OTHER INFORMATION: /note= "Translation product of SEQ ID 

NOS:58 and 60. " 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 

Leu Asp Pro Lys Lys Lys Arg Lys Val Leu Glu 
15 10 



(2) INFORMATION FOR SEQ ID NO: 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 basie pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 1.-5 

(D) OTHER INFORMATION: /note= "Xho I restriction site overhang," 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 
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(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 1..5 

(D) OTHER INFORMATION: /note= "Xho I restriction site overhang." 

(ix) FEATURE: 

(A) NAME/KEY: misc_f eature 

(B) LOCATION; 5.. 15 

(D) OTHER INFORMATION: /note= "Complementary to bases 1 to 11 of 

SEQ ID NO:53 . " 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 
TCGAGCATGG TGGCCGC 17 

(2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 
TCGACCCTAA GAMGAAGAGA AAGGTAC 27 

(2) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
. (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 
TCGAGTACCT TTCTCTTCKT CTTAGGG 2 7 

(2) INFORMATION FOR SEQ ID NO:58: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: both 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
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(B) LOCATION: 5.. 27 

(D) OTHER INFORMATION: /note= "Complementary to SEQ ID NO: 58, 

bases 5 to 27 , " 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 60: 
TCGAGTACCT TTCTCTTCTT CTTAGGG ■ 27 

(2) INFORMATION FOR SEQ ID NO: 61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 
CGACAGTCGA CGCCCCCCCG ACCGATGTC 29 

(2) INFORMATION FOR SEQ ID NO: 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 
CGACACTCGA GCCCACCGTA CTCGTC 26 



(2) INFORMATION FOR SEQ ID NO: 63: . 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: misc_feature 

(B) LOCATION: 6.. 11 

(D) OTHER INFORMATION: /note= "Sal I restriction site, 
(ix) FEATURE: 
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(A) NAME /KEY: CDS 

(B) LOCATION: 12.. 29 



(ix) FEATURE : 

(A) NAME/KEY: misc_f eature 

(B) LOCATION: 12. .29 

(D) OTHER INFORMATION: /note= "Region encoding Nonterminal 
end of VP16 (413^^490) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:63: 

CGACAGTCGA C GCC CCC CCG ACC GAT GTC 2 9 

Ala Pro Pro Thr Asp Val 
1 5 



(2) INFORMATION FOR SEQ ID NO: 64: 

(i) SEQUENCE CH2\RACTERISTICS : 

(A) LENGTH: 6 amino acids 

(B) TYPE; amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:64:. 

Ala Pro Pro Thr Asp Val 
1 5 



(2) INFORMATION FOR SEQ ID NO: 65: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1 . . 15 



(ix) FEATURE: 

(A) NAME/KEY: misc^feature 

(B) LOCATION: 1. .15 

(D) OTHER INFORMATION; /note= "Region encoding C-terminal 
end of VP16 (413-490) 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 

GAC GAG TAG GGT GGG CTCGAGTGTC G 
Asp Glu Tyr Gly Gly 
1 5 



(2) INFORMATION FOR SEQ ID NO: 66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 

Asp Glu Tyr Gly Gly 
1 5 



(2) INFORMATION FOR SEQ ID NO: 67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67 
GGAATTCCAT ATGGGCGTGC AGG 



(2) INFORMATION FOR SEQ ID NO: 68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 

His Met Gly Val Gin 
1 5 



(2) INFORMATION FOR SEQ ID NO: 69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 9 base pairs 

(B) TYPE: nucleic acid 



05/13/2005 02:30 FAX 617 951 7050 



ROPES GRAY 



l2]032/035 



(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: CDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 
CTGTCCCGGG ANNNNNNNNN TTTCTTTCCA TCTTCAAGC 3 9 



(2) INFORMATION FOR SEQ ID NO: 70: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 

I 

Arg Ser Xaa Xaa Xaa Lys Lys Gly Asp Glu Leu 
1 5 10 



(2) INFORMATION FOR SEQ ID NO: 71; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 64 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 
CTGTCCCGGG AGGAATCAAA TTTCTTTCCA TCTTCAAGCA TNNNNNNNNN GTGCACCACG 60 
CAGG 



(2) INFORMATION FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 72: 

Arg Ser Ser Asp Phe Lys Lys Gly Asp Glu Leu Met Xaa Xaa Xaa His 
15 10 15 

Val Val Cys 



05/13/2005 02:30 FAX 617 951 7050 



ROPES GRAY 



©033/035 



(2) INFORMATION FOR SEQ ID NO: 73: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 57 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : s ingl e 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73: 
CGCGGATCCT CATTCCAGTT TTAGAAGCTC CACATCNNNN NNNNNAGTGG CATGTGG 57 



(2) INFORMATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 

Glu Leu Lys Leu Leu Glu Val Asp Xaa Xaa Xaa Thr Ala His Pro 
15 10 15 



(2) INFORMATION FOR SEQ ID NO: 75: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:75: 
CGCGGATCCT CATTCCAGTT TTAGAAGC 



(2) INFORMATION FOR SEQ ID NO: 76: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:76: 

Glu Leu Lys Leu Leu 

1 5 ' 



(2) INFORMATION FOR SEQ ID NO: 77: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 77 
CGACAGTCGA CCGATACAGT CAACTGTC 



(2) INFORMATION FOR SEQ ID NO: 78: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78 
CGACAGTCGA CCAACTTGTG CCGGAAGG 



(2) INFORMATION FOR SEQ ID NO: 79: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79 
TCGAGCATGG TGGCCGC 



(2) INFORMATION FOR SEQ ID NO: 80: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 



005 02:31 FAX 617 951 7050 ROPES GRAY 



(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80 
TCGAGTACCT TTCTCTTCTT CTTAGGG 

(2) INFORMATION FOR SEQ ID NO: 81: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA 

(xij SEQUENCE DESCRIPTION: SEQ ID NO: 81 
CGACACTCGA GCCCACCGTA CTCGTC 



